Protein Sequence Grouping by Peptide Word Motifs
نویسنده
چکیده
Methods for collecting related segments from the protein sequence database using strongly conserved peptide words as well as sequence homology was applied to the problem of reconstruction of PROSITE catalog [1] from the sequence database. In many case our results were well consistent with PROSITE although some additional relationships were also found.
منابع مشابه
Arabidopsis nuclear-encoded plastid transit peptides contain multiple sequence subgroups with distinctive chloroplast-targeting sequence motifs.
The N-terminal transit peptides of nuclear-encoded plastid proteins are necessary and sufficient for their import into plastids, but the information encoded by these transit peptides remains elusive, as they have a high sequence diversity and lack consensus sequences or common sequence motifs. Here, we investigated the sequence information contained in transit peptides. Hierarchical clustering ...
متن کامل3MATRIX and 3MOTIF: a protein structure visualization system for conserved sequence motifs
Computational methods such as sequence alignment and motif construction are useful in grouping related proteins into families, as well as helping to annotate new proteins of unknown function. These methods identify conserved amino acids in protein sequences, but cannot determine the specific functional or structural roles of conserved amino acids without additional study. In this work, we prese...
متن کاملEvaluating the Effect of Protein A Signal Peptide on Extracellular Expression of Recombinant Hirudin in E.Coli
Abstract Background and Objective: Hirudin is an anticoagulant polypeptide secreted from the salivary glands of leeches. Recombinant hirudin is a strong anticoagulant agent in arterial and venous thrombosis. The aim of this study was to evaluate the effect of inserting protein A signal peptide sequence of pEZZ18 plasmid on expression and secretion...
متن کاملDiscovering Protein Function Classification Rules from Reduced Alphabet Representations of Protein Sequences
The paper explores the use of reduced alphabet representations of protein sequences in the data-driven discovery of data-driven discovery of sequence motif-based decision trees for classifying protein sequences into functional families. A number of alternative representations of protein sequences (using a variety of reduced alphabets based on groupings of amino acids in terms of their physico -...
متن کاملProbabilistic analysis of the frequencies of amino acid pairs within characterized protein sequences
Here, we describe a unique probabilistic evaluation of the 20, naturally occurring, amino acids and their distributions within the Swiss-Prot and Complete Human Genebank databases. We have developed a computational technique that imparts both directionality and length constraints into searches for unique combinations of amino acids within protein sequences. Using statistical approaches, we have...
متن کامل